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Abstract. This study identifies drivers of deforestation in Mexico by apply- 
ing Geographically Weighted Regression (GWR) models to cartographic 
and statistical data. A wall-to-wall multitemporal GIS database was con- 
structed incorporating digital land use/land cover maps for 2002 and 2007; 
along with ancillary data (road network, settlements, topography and socio- 
economical parameters). The database analysis allowed assessing the spa- 
tial distribution of forests and deforestation at the municipal level. The sta- 
tistical analysis of deforestation drivers presented here was focused on the 
proportion of anthropogenic cover in 2007 as dependent variable. I n com- 
parison with the global model, the use of GWR increased the strength in the 
relationship in terms of the goodness-of-fit (adjusted R 2 ) from 0.69 (global 
model) to 0.72 (average R 2 of GWR local models), with individual GWR 
models ranging from 0.48 to 0.81 The GWR model highlighted the spatial 
variation of the relationshi p between the percentage of anthropogenic cover 
and its drivers. Factors identified as having a major impact on deforestation 
were related to topography (slope), accessibility (road and settlement densi- 
ty) and marginalization. Results indicate that some of the drivers explaining 
deforestati on vary over space, and that the same driver can exhi bit opposite 
effects depending on the region. Based on local regression model coeffi- 
cients, a cluster analysis allowed the aggregation of municipalities with 
similar patterns of deforestation into homogeneous regions. A deforestation 
model for the entire country will be developed further, using these regions 
to divide the model procedures into sub-regions with specific deforestation 
patterns. 
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1. Introduction 



Mexico, with a total area a total area of about 2 million square kilometers, is 
a megadiverse country, but it presents high rates of deforestation (FAO 
2001). Various studies have attempted to assess land use / cover change 
(LUCC) over the last decades (Mas et al. 2004) but there have been few at- 
tempts to assess the mai n causes of def orestati on at nati onal I evel ( F i gueroa 
et al. 2009, Pineda J aimes et al 2010, Bonilla-Moheno et al. 2013). Geo- 
graphically Weighted Regression (GWR) have been applied in exploring 
spatial data in the social, health and environmental sciences. The goal of 
this study is to evaluate the spatial patterns of deforestation with respect to 
drivers reported to influence LUCC using Geographically Weighted Regres- 
sion (GWR). I n this paper, we present the prel i mi nary results. 



2. Material and Methods 

2.1. Material 

I n order to elaborate the Gl S database, thefollowi ng data were used: 

•Maps of land use/ cover (LUC) at 1250,000 scale from the Nati onal Insti- 
tute of Geography, Statistics and Informatics (INEGI for its Spanish acro- 
nym) for 2002 and 2007. These maps are compatible with regards to scale 
and classification scheme (M as et al. 2004). 

• Maps of ancillary data (digital elevation model, roads maps, human set- 
tlements, municipal boundaries). 

• Socio-economic data from the INEGI organized by municipality (Popula- 
tion census for 2000, 2005 and 2010). 

GIS operations were carried out with the following programs: ArcGIS 
(ESRI, Redlands, CA) and Q-GIS (www.qgis.org/). Statistical analysis and 
graphs were created using R (R Development Core Team 2009). Geograph- 
ically weighted regressions (GWR) were carried out using the packages: 
gwrr (Wheeler 2007 and 2012) and spgwr (Bivand and Yu 2012) in R. 

2.2. LUCC Monitoring and GIS database elaboration 

LUCC monitoring was done by overlaying the LUC maps of 2002 and 2007 
and the map of municipalities. Areas of change were tabulated and comput- 
ed as well as the rate of deforestation. Inthisstudy, the rate of deforestation 
reflects the transformation of wooded covers (temperate and tropical for- 
ests, scrublands) into anthropogenic covers (agriculture, pasture land, ur- 
ban areas). Two dependent variables were generated for each municipality: 
the proportion of area covered by anthropogenic covers in 2007 as a proxy 



for the I eve! of anthropization and the rate of deforestation between 2004 
and 2007. The rate of deforestation were obtained from municipalities with 
a forested area covering at least 500 ha and 10% of the municipality in or- 
der to avoid outlier values. More than 2106 municipalities out of 2456 ful- 
filled these conditions, representing more than 97.2 % of the country total 
area and 99. 8% of the forest and scrubland area. 

In order to determine which ancillary variables are most likely to be indirect 
drivers of deforestation, we calculated, for each municipality various indices 
describing: population, land tenure, economic activities and the resources 
accessibility. These indices were: a) Population density in 2010 (people per 
km 2 ); b) Density settlements (number of settlements per km 2 ); c) I ndex of 
marginalization (CONAPO 2010), d) Cattle density, e) Goat density, f) 
Mean slope (degrees), g) Road density (km of road per km 2 ), h) Proportion 
of municipality area with private tenure, i) Proportion of area with ejidal 
tenure, and j) Proportion of area with communal tenure. Ejidal and Com- 
munal tenures are both common-pool systems, related to land redistribu- 
tion process and land title restitution respectively. 

2.3. Statistical Analysis 

Geographically Weighted Regression is a local spatial statistical technique 
for exploring spatial nonstationarity (Fotheringham et al., 2002). It sup- 
ports locally modeling of spatial relationships by fitting regression models. 
Regression parameters are estimated using a weighting function based on 
distance in order to assign larger weights to closer locations. Different from 
the usual global regression, which produces a single regression equation by 
summarizing the overall relationships among the explanatory and depend- 
ent variables (for the whole Mexican territory in that case), GWR produces 
spatial data that express the spatial variation in the relationships among 
variables. Maps that present the spatial distribution of the regression coef- 
ficient estimates along with the level of significance (eg. t- values) have an 
essential role in exploring and interpreting spatial nonstationarity. Fother- 
ingham et al. (2002) provide with a full description of GWR, and Mennis 
(2006) gives useful suggestions to map GWR results. 

The first stage of the study was correlation analysis between explanatory 
variables using the Spearman coefficient in order to discard highly correlat- 
ed variables. Due to the uneven distribution and size of the municipalities, 
the weighting function was based on a proportion of the observations (k- 
nearest neighbors) assigned to the centroid of each municipality. The opti- 
mal size of the bandwidth was evaluated by minimizing the root mean 
square error. A map was elaborated for each explanatory variable showing 
the value of the regression's coefficients (color scaling of the centroid sym- 



bol) and statistical significance (size of thecentroid symbol). Next, a cluster 
analysis was carried out assorting municipalities with common deforesta- 
tion patterns (similar regression coefficients) into regions. Finally, a global 
regression model was applied to each region. 



3. Results 

3.1. LUCC Monitoring 

According to I NEGI maps, anthropogenic covers (mainly crop and pasture 
lands) occupy about 26% of the Mexican territory. During 2002-2007, the 
anthropogenic cover area increased (14,600 km 2 ), while tropical forest and 
scrubland area decreased (reduction of 8400 and 5200 km 2 respectively). 
The area of temperate forest remains broadly the same. As shown in Figure 
lthe Anthropization index varies over space: In the coastal floodplains of 
the Gulf of Mexico and the center of Mexico, original vegetation cover has 
been almost totally removed while other regions are better conserved. 
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Figure! Per munici pa I ty Anthropization index (2007) 



3.2. Geographically Weighted Regression (GWR) 

In this paper, we report only the results of the GWR using as dependent 
variablethe proportion of anthropogenic cover (anthropization index). The 
weighting function was based on a 5% of the observations. A global model 
was fitted and obtained an adjusted-R 2 of 0.69. The use of GWR slightly 
increased the strength in the relationship in terms of the goodness-of-fit 
(adjusted R 2 ) to 0.72 (average R 2 of GWR local models), with local GWR 
models with adjusted R 2 ranging from 0.48 to 0.81 Figure 2 presents the 
spatial distribution of the goodness of fit. 
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Figure 2. Distribution of local R 2 . 

Some variables such as population and road density and slope exhibit a 
significant relationship with the proportion of anthropogenic cover for the 
whole territory. As expected, thefirst two variables havea positive effect on 
this proportion whileslope presents a negative relationship (Figure 3). Oth- 
er explanatory variables have a more contrasted patterns. I n example, the 
marginalization index presents a significant relationship with the propor- 
tion of anthropogenic cover only for about half of the territory. It presents a 
positive relationship in the Baja California states, the border with the USA 



and the north strip along the Gulf of Mexico and a negative relationship in 
vari ous regi ons of M exi co Southeastern part ( F i gure 3) . 

The region where the relationship is positive is related to industrialized 
regions where more developed municipalities present both higher propor- 
tion of anthropogenic cover and lower marginalization index. Opposite to 
that, in the southeastern region of the country more conserved areas are 
associated to higher levels of marginalization. Previous researches have 
reported that the most conserved natural areas in Mexico are often located 
in poor rural areas and/ or community lands (Klooster 2000; Alix-Garcia et 
al. 2005, Figueroaetal. 2009, Garcia-Barriosetal. 2009). 




Figure 3. GWR coefficient and significance values for road density. 

The cluster analysis helped dividing the whole territory into five principal 
regions (Figure 5). It is worth noting that some of these regions have a high 
coincidence with other territory zoning such as physiographic map (I NEGI 
1981). Table 1 presents the variables retained in each regional regression 
model, along with thesign and the significance level of their coefficient. 




Figure 4. GWR coefficient and significance values for marginalization. 
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Table L Sign and significance of coefficients region-based regression. Significance 
codes: *** 0.001**, 0.01*, 0.05 . 




Figure 5. Regions based on the cluster analysis of GWR coefficients. 



4. Discussion and Conclusion 

Some I imitations of this study have been identified: First, it is based only on 
a drastic change of land cover (forested cover versus anthropogenic cover), 
it does not consider cover degradation. This factor has to be considered 
during the results interpretation. For example in some regions goat density 
is associated with lower levels of anthropization, however it is likely related 
with vegetation cover degradation considered in this study as natural (with- 
out any indication about degradation level). Second, we used contemporary 
explanatory variables (eg. 2005 and 2010 population census) to explain 
anthropization patterns resulting from centuries of landscape transfor- 
mation. In fact, it is likely that the effect of a driver on a given region is re- 
lated to the time such driver has been shaping the landscape. Finally, as 
depicted in figure 2, the set of explanatory variables we used did not allow 
to explain the dependent variable in a satisfactory manner for the entire 
territory. Moredrivers have to betaken into account for future analysis. 



However, results dearly show the advantages of a local approach (GWR) 
over a global one, to assess different drivers' effect on LUCC over such a 
complex and diverse territory as Mexico. In future researches, explanatory 
vari abl es wi 1 1 be i ntegrated i nto the model to test the effect of mi grati on and 
public policies. Considering the rate of deforestation during different past 
periods of time (instead of thelevel of anthropization) will enableusto ana- 
lyze the dynamic of deforestation in its temporal and spatial dimensions. A 
deforestation model for the entire country will be developed, using the sug- 
gested regions (figure 5) dividing the model procedures into sub-regions 
with specific deforestation patterns. Spitting the model into sub-regions is 
expected to i mprove the prospecti ve map accuracy produced by the model . 
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